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Abstract 

CO , We revisit the probability that any two consecutive events in a Pois- 

TTJ ' son process A'^ on [0, t] are separated by a time interval which is greater 

^\ . than s {< t) (a particular scan statistic probability), and the closely re- 

I ' ^ lated probability (recently introduced by Todinov [5], who denotes it as 

f~^ ' Pmffop) that before any event of A'' in [0, t] there exists an event-free 

^^ . interval greater than s. Both probabilities admit simple explicit expres- 

sions, which, however, become intractable for very large values oit/s. Our 
main objective is to demonstrate that these probabilities can be approx- 
^^ , imated extremely well for large values of t/s by some very tractable and 

?—( ' attractive expressions (actually, already for t larger than a few multiples 

=^- of.). 

1 Introduction 

Throughout this paper, N — {Nt : i > 0} denotes a Poisson process with rate 
A > 0. For any s > and t > 0, wc define 

P(7V, <2)(=e-^*(l + Ai)) , 0<t<s; 

(s,<;A) = { 

P{ma.Xo<r<t- s[Nr+s - Nr] <2) , t> s. 



By convention, ip{s,t;\) = 1 if i = 0. That is, (j){s,t;X), for s, i > 0, is the 
probabihty that N has no two consecutive events in [0,i] separated by a time 
interval which is shorter than s. Its complement is denoted by 

(j){s,t;\) := l-(f){s,t;X), 

and gives us the distribution function of the random variable t(s; A) defined to 
be the waiting time until two events of N occur within s time units; namely, 

P(r(s;A) <f) = 0(s,i;A), t > 0. (1.1) 

As indicated in the abstract, the probability (j){s,t]X) is a particular scan 
statistic probability. Indeed, the random variable Yn{s, t) defined by Ym{s, t) = 
maxo<r<t-s[-^r+s — Nr] is the scan statistic of 'window size' s for the Poisson 
process N on [0, t]. The distribution of Yn{s, t) has been extensively studied in 
the literature. However, the particular scan statistic probability P(Yiv(s, t) < 2) 
seems to have received little attention in the literature relative to its importance 
(and simplicity). Yet, as the author discovered after most of this research was 
completed, the fundamental result of this paper has been essentially identified in 
the (scan statistic) literature (Gates and Westcott [3]); nevertheless, our overall 
presentation of the result has several advantages and novelties. This point will 
be discussed in some detail in Appendix A, thus highlighting the contribution 
of our paper. It should also be stressed here that approximations to the general 
scan statistic probability P(YAr(s,i) < k), k > 2, which are readily available in 
the literature, are usually not satisfactory when applied to our case of interest, 
namely, fc = 2. A familiar and relatively successful one is due to Naus [5]; 
however, it is far too inferior compared to our results — both from the accuracy 
point of view, and from the simplicity one (see also Appendix A). 

Closely related to the probability 0(s, t; A) is the probability that before any 
event of TV in [0, t] there exists an event-free interval greater than s. We shall de- 
note it by (/)2(s, t; A). This probability has recently been introduced and studied 
by Todinov [8] , |9] , [1^ (to cite a few) in the context of reliability engineering 
(sometimes in a more general setting), where it is denoted as pmffop- The 
acronym MFFOP stands for "Minimum failure-free operating period" ; thus, in 
Todinov's framework, the events of N are interpreted as "failures" . 

It follows straight from the definitions that 

(/)2(s,i;A) =e"^'*</«(s,i-s;A), t > s. (1.2) 

Indeed, the event associated with the probability (p2{s,t;X) can occur if and 
only if A^s =0 and N has no two consecutive events in [s, t] separated by a 



time interval which is shorter than s. Relation (|1.2p allows us to focus on the 
probability (t>{s,t; A) (which is more interesting, in our view). 

The probability 0(s,i; A), for s, i > 0, admits the following expression: 

cj,{s,t;X)=e'^'h+J2— ^i ^1 (1-3) 

(Todinov [5", Eq. (17)], % Eq. (4)], [TOl Eq. (15.4)]), where [•] denotes the 
ceiling function. For completeness of exposition and convenience of the reader, 
we give here two full proofs of this simple result. 
Writing the right-hand side of (|1.3p as 



where l(-) denotes the indicator function, and noting that the greatest possible 
minimum distance between fc > 2 points in [0,t] is t/{k — 1), it follows by the 
law of total probability (conditioning on the number of events of A^ in [0,t]) that 
(|1.3p is true if (1 — (fc — l)s/t)'^ is the probability that the minimum distance 
between fc > 2 i.i.d. uniform random variables on [0,t] is greater than s. The 
latter is a well-known fact; however, we take this opportunity to prove the 
following more general fact (Todinov (H Eq. (7)]). 

Let f/(i), . . . , U(^n) be n order statistics from a uniform distribution on [0, i], 
and si_2, • • • , Sn-i,n nonnegative numbers with X]"=i •Sj.i+i < t- Then 



(l-iE^M+ij • 



P([/(,;+i)-C/(,) >Sj,j+i Vi = l,...,n-l) = ll - -2^Si,,+i I . (1.4) 

The proof we now give is a rigorous justification of Todinov's explanation in 
[3 Section 3.1], and a short, straightforward alternative to his derivation in [5]. 
Define 

A = {(-ui, . . . , u„) e [0, i]" : Ui+i -Ui > s^a+i Vi = 1, . . . , n - 1} 

and 

B = {{ui,U2- Si,2,.-.,Un- {Si,2^ h s„_i.„)) : (ui , . . . , u„) G A} 

(1.5) 

= {(Wl, ■^2, . ■ . ,Vn) ■■ < Vi < ■ ■ ■ < Vn < t - (si_2 H h S„-l,„)}. 

Recall that the joint distribution of the order statistics in (|1.4[) is given by 
P(C/(i) e dwi, . . . , f/(„) e dun) — nl/t"- dui ■ ■ ■ dun, < ui < • • • < u„ < t. 
Thus, the left-hand side of ()1.4p is equal to ?i!f^" J^ dui ■ ■ ■ du„, and hence also 



to n\t~" Jg dui ■ ■ ■ dun since, as is clear from the first equality in (|1.5p . A and B 
have the same Lebesgue measure. On the other hand, by the second equality in 
(|1.5I) . n\t~" J^ dui ■ ■ ■ dun is the probability of {[/(„) <t — X]"=i Si.i+i}- Thus, 
p. 41) is established. 

We now give an alternative, original proof of (|1.3p . The 'base case' \t/s] — 1 
(i.e., < i < s) holds by definition. Assume that t > s. By conditioning on the 
time of the first event of N, we obtain 

pt — S 

(j){s,t;X)^ / e^^"<?!)(s,i-(u + s);A)Ae"^"du 
Jq 

+ f e-^(*-"Ue~^"dM 

Jt-s 

/oo 
Ae"^" du, 

leading to 

' -^*(1 + At) , <i< s; 

is,t;X)^{ '^ ' ^, - ' (1.6) 



e 



-At 



(1 + As) + Jl Ae^^"(?!)(s, i - u; A) dw ,t>s. 



The rest of the proof is by induction on \t/ s\ = n. Suppose that (|1.3p holds for 
n, and let \t/ s\ = n + 1. Making the change of variable i — m i— >■ u in (|1.6p and 
then applying the induction hypothesis, we obtain 

pt — S 

(j>{s, t; A) = e-^*(l + As) + e"^* / Ae^"</>(s, u; A) du 

Jo 






-(l + As)+e-- / A( 1+ y: ^>^^^1M! )d. 



e-^* 



(l + AO+e-Atr ^>^^^^^d. 



(1.7) 



Thus, (|1.3p is established. 

Remark 1.1. It is worth noting that depends on s, t, and A only through t/s 
and As. In particular, we have that 4>{s,t; A) = 0(1, i/s; As). 

Taking into account the convolution theorem for Laplace transforms, Eq. 
(|1.6p invites us to calculate the Laplace transform of 0(s, i; A) as a function of t 
(for s fixed). Anticipating Theorem 12.21 below, we note that, based on (|1.3p . it 



can be shown that the equahty 

e-"V(g,^;A)d^ = ^ + ^- I -iu+x)s (1-8) 

holds for aU u > Uc ■= — A[l — B{Xs)], where B{Xs) is the solution B e (0, 1) 
of e"'*'*'^ — B, while for u < Uc the integral diverges. This number Uc is 
thus the abscissa of convergence of the Laplace transform of (f){s,t;X); it is, 
trivially, also the abscissa of convergence of the Laplace transform of Ae^"*, 
where A = A{X, s) > is a constant with respect to t. Theorem 12.21 gives the 
precise relation between the functions (f>{s,t; X) and e"'^*. Actually, the origin of 
Theorem 12.21 lies in the following remark. 

Remark 1.2. It follows from either (|1.3I) or the first equality in (II. 7p that 

d 

— 0(s, t; A) = -A(/>(s, i; A) + Ae^^^(?!)(s, i - s; A), t> s. 

With the above notation, we note that this differential equation is also satisfied 
by the function Ae^"*. 

Recall (|1.2p . Then, from (11.31) it follows immediately that, for s, i > 0, 



02(.,i;A)=e-^*('i+^'x:'^^^V^)' (^-^^ 

which is Todinov's formula, given e.g. in 8, Eq. (16)], [SJ Eq. (3)], and \W, Eq. 
(15.2)]. (However, relation (|1.2p has not been observed in Todinov's works.) 
Just as in Remark ITTTl we have that (p2{s,t;X) = (j)2{l,t/s]Xs). 

To conclude the introduction, we note the following simple facts. 

Let fi, ^2, ■ • • be the interarrival times of N, and set Y = inf{i : ^i < s}. 
Then, F is a geometric random variable with P(Y = i) = e^'^*('^^^(l — e~'^*), 
and the following equality in distribution holds: 

Y Y-l 

r2(s;A):=5]e. ' E(^ + ^~^) + C. (l-W) 



d 



where ^i (just as ^i) are independent exponentials with mean 1/A, independent 
also of Y and of the random variable ( with density (1 — e~'*'*)~^Ae~'''^, < 
X < s. The random variable T2 is related to the probability 02 by 

P(t2(s;A) <i) = l-<?!)2(s,i;A), t>0. 

Using the decomposition on the right-hand side of (ll.lOp . one can show that 

E[r2(s;A)] = -— ^^3- and Var[T2(s; A)] = 



A(l-e-^'^) ." L zv"> vj ^2(i_e-A.)2- 



(Actually, the formula for the expectation follows immediately from Wald's 
identity, applied to T2(s; A) = X]i=i fi •) Then, from the obvious relation 

t{s;X) ^£, + T2{s;X), 

where ^ is exponential with mean 1/A, independent of T2(s; A), we immediately 
obtain the expectation and variance of t{s; A). 



2 Main results 

Although (|1.3I) gives us a neat result, it may not be effective for very large values 
of t/s. This point will be discussed in the next section. Our first main result, 
Theorem 12. II below, gives an asymptotic power series expansion for 0(s,i; A) as 
s 4, 0. Before we state and prove it, we introduce the following polynomials, 
defined for a; > 0: 

e X 



^"(^) = E-e:^[^ + ("-!)]" 



fc=0 



kl 



El (--i)""'E^(^'^v, 



i=0 ^ ^ i=0 



where S{i,j) denote the Stirling numbers of the second kind and where we have 
used the well-known identity 

fc=0 ' j=0 

expressing the moments of the Poisson distribution with mean x in terms of 
Stirling numbers of the second kind (see e.g. Comtet [H p. 160]). The S{i,j) 
can be calculated recursively as follows (see e.g. Comtet '^, p. 208]): 

S{i,j) = S{i - l,j - l)+jSii - l,j) for 2 < J < z, 

with S{i,l) = S{i,i) = 1 for i > 1. In addition, S{i,0) := l{o}(«)- The first 
eight polynomials P„(x) are listed in Table [T] 

Theorem 2.1. Fixt>0. Then, 

0(s,t;A)-y (-l)"P„(At)^ assiO. 
That is, for each N — Q,\,2, . . ., 

N 



\n n 

s,i;A)-y (-l)"P„(At)^-— = 0(s^+i) assiO. (2.1) 



n^O 



Table 1: The first eight polynomials P„(a;). 



n 


Pnix) 





1 




1 


X 




2 


l + 3x + x^ 




3 


8 + 19a; + 9x'^ + x^ 




4 


81 + 175a; + 97a;2 + ISa;^ + x'^ 




5 


1024 + 2101a; + 1275a;2 + SOSx^ + SOa;"* + x^ 




6 


15625 + 31031a; + IQOSlx^ + 5590a;3 + 740^^ + 45a;5 + a;^ 




7 


279936 + 543607a; + 365001a;2 + 113701x3 + 18200a;4 + 1526a;5 + 63a;6 


+ x' 




Proof. We will show that ([STT]) holds for aU N = 1,3,5,..., the case N = 
0, 2, 4, . . . being analogous. Fix t > and iV G {1, 3, 5, . . .}, and let K := \t/s] 
be arbitrarily large. It follows from (II. 3p . using a simple property of the binomial 
expansion, that 4i{s,t; A) is bounded from above by 

I fe=2 ■ fc=JV+2 ' 1=0 

Collecting powers of s, it follows straightforwardly that 

E(-ir^^»(A^)- E ^^[^ + ('-1)] 

n=0 ' I k=K-n+l 

It is easy to show that the second term in the curly brackets is ols"^) as s | 0, for 
any r > 0. We conclude that the left-hand side of (12. ip is bounded from above 
by |0(s^+^)| as s I 0. The analogue of (j2.2p for a lower bound is obtained by 
replacing A^ + 1 by TV and A^ + 2 by A^ + 1; continuing as in the upper-bound 
case, we conclude that the left-hand side of (|2.1I) is bounded from below by 
o{s^) as s I 0, for any r > 0. Thus (12.11) is proved (for N odd, and analogously 
for N even). ■ 



Remark 2.1. From the proof of Theorem l2.11 we can obtain the following result: 

,2 

4>{s,t;X)^e^ a.s s 10, t -^ oo, st ^ c, (2.4) 

for c> fixed. Indeed, with s and t as in dH]), F„(A^)A"s" -^ (A^c)" (for ah 
fixed n) and, as is easy to show, the second term in the curly brackets of 



tends to 0. It is straightforward to verify that this result is consistent with the 
approximation (|2.16p . below. 

Remark 2.2. Similarly to Remark |2. 11 from the proof of Theorem 12.11 we can 
obtain the following result: 

(/)(s, i; A) -> e"''* as s | 0, A ^ oo, A^s -^ c, (2.5) 

for c > fixed. Indeed, with s and A as in 1^, P„(Ai)A"s" -^ (ci)" (for ah 
fixed n) and, as is easy to show, the second term in the curly brackets of p.3p 
tends to 0. (Actually, (|2.5p can be deduced from (|2.4I) by applying the relation 
4>{s,t; A) = 0(1, i/s; As).) Again, it is straightforward to verify that this result 
is consistent with the approximation (|2.16p . below. Now, recalling p. II) . the 
following statement is immediate. With s and A as in (|2.5p . the waiting time 
until two events of N occur within s time units converges in distribution to an 
exponential random variable with mean 1/c. 

Our second main result, Theorem l2.2l below. provides a tractable asymptotic 
expression for 4'{s,t] A) as i — >■ cxi. As we shall confirm numerically in the next 
section, this expression turns out to be an extremely successful approximation 
to (/)(s, t; A) for large values of t/ s in general — far more successful than one would 
expect a priori; surprisingly, it is even superior to its counterpart ()2.ip for the 
case when t is fixed and s is small. 

We first introduce some more notation. We let 1^9(5, i; A) denote the function 
defined by (p(s,i;A) = e^*ip{s,t;X), s > 0, i > 0. By Remark 1 1.11 the function 
ip{s, t] A) is a function of x := As and r ;= t/s; we denote this function of x and 
r by (p{x, r). Specifically, for any x > and r > 0, 

^{x,r) = \ + Y.— ^! —■ (2-6) 

fc=i 

We further denote by B{x) the function B : (0,oo) — > (0,1) defined as the 
solution of the implicit equation 

eyiY>[—xB{x)] ~ B{x), 

and by A{x) the function defined by 

^^""^ = B{x)[l + xBix)Y ^^-^^ 

The functions A{x) and B{x) are plotted in Figure [T] Finally, we let A{x,r) 
denote the function defined by 

Aix,r)= f^;\, . (2.8) 

exp[xB{x)r\ 
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Figure 1: The functions A{x) and B{x) 



Thus, 0(s, i; A) can be written as 

As is customary, f ^ g means that f /g — > 1. 

Theorem 2.2. The following convergence holds for any x > 0: 

A{x, r) — > A{x) as r — >■ oo. 

Hence, for any fixed s > 0, 

4>{s, t- A) - A(As)c-^[i-^(^")l* as t ^ oo. 



(2.9) 



(2.10) 



(2.11) 



Proof. To prove the theorem, we will use a well-known result from analysis, the 
Wicner-Ikehara theorem (see Appendix B). In order to apply it, we first rewrite 
(PHT]) as 

^'"''fi^^^rn^)^ Mx) asr-^cx3. (2.12) 

Next, we have to calculate the Laplace-Stieltjes transform of f{x, ^gi^\ ) as a 
function of r (for x fixed). We denote it by /(z; x), where z = u-\-iv with u > 1. 
We claim that 

1 



f{z-x)^l + 



zB[x) — exp[— za;-B(a;)] 



u > 1. 



(2.13) 



As a first step, let us justify the following equality: 



e ^Xx, 



xB{x) 



)dr, 



JQ- JQ 

under the assumption that the right-hand side exists if u > 1. Integrating by 
parts, we have 



M rM 



d^(-' ^^) = e-^*^ ^(-, ^) + ^ X '"'^^'^' ^^^ ^" 



^-zM 



(f{x, ^rgT^y) — !> as M -> oo. This is easily 



We thus have to show that e 
seen to hold if z is real (> 1), for the left-hand side is then increasing in M and 
hence converges. Hence, it also holds for complex z, by taking absolute value. 
It now follows that ()2.13p will be proved if we show that 

Xa;, ^W(S)) dr = 1 + ^^(.^) _ cxp[-zxB{x)] 



z lim 

n— foo , 
nGN "'0 



-, w> 1 (2.14) 



(consider first real z, then complex). Making the change of variable r/{xB{x)) >—>■ 
r and then substituting from (J2.6I) . we obtain 



V(a;,ZffM)d^ 



xB{x)Y, e--^^(-)Ml + ^ 

7 = 1 "'j-1 ^ k=l 



x^[r-{k-l)f 



k\ 



Ar 



xB{x) 



r-ri+l-j 



-zxB{x)r X'['r-{i -'^W 



dr 



^xB{x)\ \ e-""^(")'"dr + ^ / 



zxB{x-){r+^-\) -^ ' j^ 



j! 



We now fix M e N (arbitrarily large) and let n > M, to be sent to oo. A 
straightforward calculation, using the gamma function, gives 



E 

j=M 



n+l-j 



'ZxB(x){r+j-l) • 



■ dr 



< 



1 



x[uB{x)]'^ 



E 

3=M-1 



^ — uxB{x) 

uB{x) 



Moreover, it follows immediately from the definition of B{x) that e "^^(^) < 
uB{x) (for M > 1). Hence, from the arbitrariness of M, we conclude that 



lim 



->(a 



xB{x) 



cB(x) 



\ dr 



/"CO QQ I'OO j j ^ 

/ e'^^^^^^'-dr + V / e-^^^(")('-+J-i)^drl, 



10 



provided that the infinite sum converges; indeed, it converges whenever |e '^^^^^'^ /{zB{x))\ < 
1 , which is clearly implied by u > 1 . The last assertion follows from the equality 



j! x[zB{x)]^ 



-zx 



B{x) 



3-1 



zB{x) 

which, in turn, leads straight to p.l4p . Hence (|2.13p is proved. Thus, in order 
to prove (|2.12l) (and hence the theorem) using the Wiener-Ikehara theorem, it 
suffices to show that the analytic function 

^("'") = ^ + zB{x)-eM^zxB{x)] " 7^' " > ^' ^^'^''^ 

has a boundary function g(l + iv;x) in the following sense. For u J, 1, the 
function g^ivv^x) := g{u + iv]x) converges to 17(1 + iv]x) uniformly on every 
finite interval {— / < v < I}. This is clearly satisfied for {6 < \v\ < I}, for 
arbitrarily small d > 0. The uniform convergence on {^l < v < 1} will thus be 
established upon showing that g{u + iv; x) has a limit as u J, 1 and u — )■ 0. In 
fact, reducing the fractions in (|2.15p to a common denominator and applying 
I'Hopital's rule twice (at z = 1), we find that 

, . , , x^B{x) 
lim g{u + iv;x) = 1 + -— ^, ,.,„ . 

The theorem is thus established. ■ 

It is instructive to note here the following remark. 

Remark 2.3. Suppose that h{u) := u J^ e"^^ h{r) dr — the 'Laplace-Stieltjes 
transform' of a function h — exists for u > 0, and that lim„^o h{u) ~ c. Under 
a suitable condition ( Tauberian condition) on h, limj._>.oo h{r) — c (Final Value 
Theorem). When applied to the function A{x,r), x fixed, this yields (I2.10p (as 
follows readily from (|1.8p ). However, it might be difficult to show that A{x,r) 
fulfills a required Tauberian condition for the validity of this application; cf., 
e.g.. Theorem 1.7.6 in Bingham, Goldie, & Teugels [T] (Karamata's Tauberian 
Theorem; extended form). The Wiener-Ikehara theorem, which is a complex 
Tauberian theorem, provides a convenient way to establish the desired result; 
an alternative is indicated in the last paragraph of Appendix A. 

Since the function ip{x, r) plays a crucial role in this paper, it is worth making 
the following easy observation. 

Remark 2.4. The function ip{x, r) — as well as its asymptotic expression A{x)e^^'^^^^ 
as r — > cxD — satisfies the differential equation 

d 

-—ipix, r) = xip{x, r — 1), r > 1. 
or 



11 



In view of (I2.9P and Theorem 12.21 it is natural to define and study tfie 
following approximation, which is designed for large values of t/ s: 

<j)*{s, t; A) := A(As)e^^[i-^(^^)l* w (j}{s, t; A). (2.16) 

(It indeed deserves to be denoted with a '•'.) As we shall see in the next 
section, the approximation (j2.16p may be effective already for t larger than a 
few multiples of s, and, moreover, even in the entire range of interest {t > s}. 

To conclude this section, recall the function 02(s,t; A) given by (II. 9p . From 
(|2.16p and relation (|1.2p . it follows straightforwardly the following approxima- 
tion (appropriate for s and t as above): 

Ms, t; A) w A(As)B(As)e-^[i--«(^")l* 

^ ^ -\\1-B{\s)]t 

1 + \sB{\s) 
This establishes the interesting relation (j)2{s,t\X) ~ i3(As)(/)(s,i; A). 

3 Numerical results 

In this section we demonstrate, in various ways, how extremely successful the 
approximation (|2.16p is. We first note that since 

0*(s,i;A)-0(s,i;A) = [A{Xs) - A{Xs,t/s)]e-^^^-^'^^'^^\ (3.1) 

we will pay particular attention to the difference between A{x) and A{x^r). 
However, the following point should be kept in mind in the sequel. On the one 
hand, as is seen from Table [5] and Figure [51 the quality of the approximation 
A{x, r) « A{x) reduces as x increases. On the other hand, the larger x (= As) 
is, the closer the negative exponential term in p.ip will be to 0. 

We begin by considering the following example, referring to p.4p . 

Example 3.1. We have implemented the computation of (j){s,t;X) (Eq. ()1.3|) ) 
and (/)*(s,i;A) (Eq. ()2.16|) ) in the Java language. Since, as we shall see later, 
(j){s,t;X) and (f)*{s,t;X) are extremely close to each other for large values of 
t/s, one may study the convergence (|2.4p based on </>* values. As a concrete 
example, we let A = 1, s = 1/v^, and t = y/r, for r very large. By (|2.4p . 
(j){s,t; A) —^ e~^ as r — >■ oo. When r was set to m • 10'', for m = 8, 9, and 10, 
the computational time for </> was too long: approximately 11.1, 14.0, and 17.3 
minutes, respectively. The (f>* values, on the other hand, were computed in 'no 
time'. Computation of (f)*{s,t;X) — e~^ for various values of r indicated that 
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Tabic 2: Tabulated rounded values of A{x, r) — A{x) for various values of x and 



X 






r 








10 


25 




50 


100 


0.010 


+6 • 10-33 


-7 


10-^5 


+1- 


10-148 


+ 9 - 10-293 


0.025 


+3 • 10-28 


+1 


10-64 


+9- 


10-126 


-1 - 10-247 


0.050 


-7 • 10-25 


-4 


10-56 


-9- 


10-109 


+ 3-10-214 


0.100 


-9-10-21 


+7 


10-48 


-2 


10-92 


-9-10-182 


0.250 


+9-10-1^ 


-3 


10-37 


+4 


10-73 


+ 2 - 10-142 


0.500 


+2-10-13 


+1 


10-30 


+9 


10-59 


-2-10-115 


1.000 


-6-10-11 


+8 


10-25 


+9 


10-49 


+5 - 10-93 


2.500 


+7-10-9 


-5 


10-19 


+2 


10-35 


-8-10-7" 


5.000 


+1 - 10-^ 


-4 


10-15 


+8 


10-29 


-4 - 10-57 


10.00 


-9 - 10-6 


+4 


10-12 


+3 


10-24 


+8 - 10-46 


25.00 


-5 - 10-1 


-3-10-9 


-7 


10-18 


-3 - 10-35 


50.00 


+1 - 10-3 


+2- 10-^ 


-7 


10-15 


+3 - 10-29 


100.0 


+2 - 10-2 


-4- 10-6 


+2 


10-12 


-2-10-24 



0.01 - 



0.005 



-0.005 



-0.01 



- 


1 


D2/ \ 


1 1 1 


1 .■■■ 1 


1 








■■. ....■■■■■■■■■■■" De 










■-""":::::-" 


'■■■■: D5 


O7 




- 


1 


1 1 


"d: 

1- 1 1 





: 


1 1 


1 



1 



10 



Figure 2: The differences D^.{x) := A(x,r) — A{x), r = 1, ... ,7, for x ranging 
in [0,10]. 
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this difference is asymptotically l.he~^ /^ as r -> oo; this was confirmed by 
more accurate calculations, suitable for r as large as 10®", using WIMS function 
calculator (see below). 

By letting t ~1, s = 1/r, and A = -/r in Example l3.1[ we get its analogue for 
(j2.5p (note that 4>{l/r, 1; ^/r) — (?!>(l/y^, ^Jr; 1)). Everything else is unchanged 
from Example 13.11 

We also note that Monte Carlo approximation of (/)(s,i;A) — which is very 
easy to implement — could have been useful if we did not have the incomparably 
more powerful approximation (|2.16|) at our disposal. 

In order to demonstrate how extremely accurate the approximation (|2.16|) 
is, we need a powerful computing tool. We take this opportunity to recommend 
WIMS 'Function calculator', a module under the WWW Interactive Multipur- 
pose Server (WIMS), available, for free, at 



http://wims.unice.fr/wims/wims.cgi?module=tool/ analysis/function. en 



Using WIMS function calculator, we calculated the difference between A{x, r) 
and A{x) (recall ([XT]) ) for various values of x in [0.01, 100] and r in [10, 100]. The 
results are tabulated in Tabled The A{x) values were obtained effortlessly from 
the definition (|2.7p : however, in order to obtain the A{x, r) values (from (|2.8p ). it 
was necessary to enter the function (/^(x, r) manually. From Table[2l it is evident 
that A(x, r)~ A{x) tends to very rapidly as r — >■ oo, uniformly for x in bounded 
intervals [0, a]. This confirms our claim about the high quality of approximation 
(|2.16p , recalling, moreover, the last sentence in the first paragraph of this section. 

Remark 3.1. It should be pointed out that the fact that r is integer in Table 
[2] has no intrinsic significance. For example, the following rounded values of 
A{x,r) - A{x) were obtained (cf. Table E]): +5 • lO'^^ for x = 0.01, r = 10.3; 
-8- 10-48 for X = 0.10, r = 24.8; -1 • lO^-^^ for x = 1.00, r = 50.2; and -7- lO"-*® 
for X = 10.0, r = 99.7. 

Recall from Example 13.11 that we have implemented the computation of 
(j){s,t;X) (Eq. (|1.3p ) in the Java language. In this setting, a significant draw- 
back of Eq. (|1.3p (in addition to that indicated in Example 13. ip is that the 
term ak '■— A'^[t — (fc — l)s]''/k\, assuming it is a double-precision number, 
would return the value Infinity if it overflows the value of Double .MAX_VALUE, 
which is approximately 1.8 • lO'^"^. (We note that a^ should be computed as 
Y[i=i ^[t ~ {k — l)s]/i, in order to avoid potential Infinity in the numerator or 
denominator.) As an example, if we set s = 0.1, t = 500, and A = 2, then ak 
yields Infinity for 376 < k < 1431; hence, ^(s, t; A) would not be computed. On 
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Table 3: Tabulated rounded values of Sn{s, t; X) — (f)* {s , t; X) , N — 3,. . . , 7, for 
s = 10"^, t = 1, and A = 100, 200, and 400. 



A 






N 








3 


4 


5 


6 


7 


100 


+5 • 10-*^ 


-1- lo-'^ 


+2-10-9 


-4- 10-" 


+5-10-13 


200 


+1 • lo--'' 


-9-10-5 


+7-10-*^ 


-4- lO-'^ 


+2 - 10-8 


400 


+2-10-1 


-7-10-2 


+2 - 10-2 


-5-10-3 


+1 - 10-3 



the other hand, ^*(s,i;A) has been calculated, according to (|2.16p . to a very 
high precision (validated by comparison with WIMS). Moreover, changing t in 
this example from 500 to 400 leads to the incorrect value for (j>{s,t; A) (using 
Java), whereas, by comparison with the value obtained for 0*(s, i; A) (using Java 
or WIMS), the correct value should be approximately 1 - IQ-^**. 

At this point, it is interesting to compare the approximation (j2.16p with its 
counterpart (|2.ip for the case when t is fixed and s is small. For this purpose 
and for later use, we shall consider the corresponding complements. We define 

5^(s,t;A) = f](-l)"+iF„(At)^, (3.2) 

?i— 1 

viewing it as a function of s. By (|2.1|) . 

0(s,i;A) =5Ar(s,i;A)+0(s^+i) as s I 0. (3.3) 

We also define (l)*{s,t; X) = 1 — (/)*(s,i;A). We first recall from Example 13.11 and 
the paragraph that follows it that the exact formula (|1.3p might be impractical 
when r = t/s is very large, due to very long processing time. Thus p.3|) has an 
obvious merit over (jl.3p in the asymptotic setting as s J, 0, since Sj\j, like (f)*, is 
computed in 'no time'. However, as one would expect from (j3.2[) and Table [TJ 
TV should be chosen with care when A is relatively large (especially if so is t), 
depending on the accuracy we wish to achieve. Hence the general superiority of 
4>* over Sn in approximating (as s 4, 0). This point is demonstrated in Table 
Hand Figure El 

Having considered the case where t/s is large — the important case in our 
context — we now claim that the approximation (12.161) may be effective already 
for t larger than a few multiples of s, and, quite surprisingly, even in the entire 
range of interest {t > s}. This is apparent from Figure [21 combined with p.l|) . 
for the case where (r =) t/s = 1, ... ,7. As in Remark 13.11 we note that the fact 
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0.001 0.002 0.003 0.004 0.005 0.006 0.007 0.008 0.009 0.01 

Figure 3: The first seven polynomials Sn{s, t; A) for i = 1, A = 10, and s ranging 
in [0,0.01]. 

that r is integer in Figure [5] has no intrinsic significance. This point is demon- 
strated in Figure m showing that in the case s = 1, A = 1 a quite satisfactory 
approximation is obtained uniformly for t > 1. Much more impressive approx- 
imations can be obtained by decreasing or (especially) increasing A, as well as 
by decreasing or increasing s. We thus reveal another important advantage of 
approximation (|2.16|) over the exact form (|1.3p . namely, the simple structure of 
the former when (/)*(s,i; A) is viewed as a function of t (> s). For example, by 
(jl.ip . the distribution function of the waiting time t(s; A) until two events of N 
occur within s time units can be approximated, for t > s, as 

P(r(s; A) < t) « ^*(s, i; A) = 1 - A(As)e-^[i-^(^^)l*. 

(Note the resemblance to an exponential distribution function.) 

Appendix A: 

Comparison with Gates and Westcott's result 

In this appendix we elaborate on the point made in the second paragraph of 
Section 1 regarding the fundamental result of this paper (namely, Theorem l2.2p . 
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0.003 
0.002 - 
0.001 - 


-0.001 h 

-0.002 

-0.003 
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-0.006 




1 1.2 1.4 1.6 1.8 2 2.2 2.4 2.6 2.8 3 



5e-05 
4e-05 - 
3e-05 - 
2e-05 
le-05 




-le-05 



"1 1 1 1 1 1 1 1 r 



#2 - 



3 3.2 3.4 3.6 3.8 4 4.2 4.4 4.6 4.8 5 

Figure 4: The difference 4>*{s, t; A) — (j){s, t; A) for s = 1, A = 1, and t ranging in 

[1,3] (#1) and [3,5] (#2). 

In our notation. Gates and Westcott j3], who considered the distribution 
of the scan statistic YAr(s,i), gave the fohowing expression for the function 
(p(s,i;A) =e^V(s,t;A) ,3, Eq. (2.3)]: 



(p(s,i;A) 



exp[p{t + s)] 



sp 



+ exp(pi)o(l) as t — > oo. 



(A.l) 



where p = p{X,s) is the sohition of pexp(sp) = A. It is readily checked that 
the solutions p(A, s) and B{Xs) are related by p{X, s) = XB{Xs) and, in turn, 
that (jA.l| is equivalent to ()2.11|) . However, our presentation is evidently more 
elegant and instructive. 

Much more importantly. Gates and Westcott underestimated the accuracy 
and validity of the approximation. They considered it "extremely accurate 
(typically to six or seven significant figures) even for small L" , where L is i 
in our notation, and illustrated its accuracy in Table 1 of their paper. How- 
ever, the values tabulated in that table are relatively inaccurate and mislead- 
ing, the most notable example being the values 0.5439808 and 0.5439807 for 
^(1, 20; 0.2) and 4>*{l, 20; 0.2), respectively, whereas using WIMS we have found 
that (j){l, 20; 0.2) - 0*(1, 20; 0.2) « 9.7 • lO^^s. As for the validity of the approxi- 
mation. Gates and Westcott apparently did not realize that the approximation is 
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valid for large values of t/ s in general, and in particular in the asymptotic setting 
as s 4 0; their Table 1 presents, in addition to the approximation of (/)(!, 20; 0.2), 
approximations to 0(1, 20; 0.3) and (/)(!, 10; 1). That table also presents the cor- 
responding values of Naus's approximation mentioned in Section 1; roughly, the 
errors from the exact (f> values are of order 10~^. 

A final note is in order concerning the derivation of (|A.1[) by Gates and 
Westcott. In our notation. Gates and Westcott noted from Penrose and Elvey 
[B] that the Laplace transform of (/j(s, t; A), as a function of t, is given by 
' 1 A 

-"^^^(-'^^^)-^^- n^[l-Aexp(-..)/.] (^-2) 

(say, for all z satisfying Re(z) > A), and that inverting it by residues gives 
(jA.ip . However, justifying the second part of this statement rigorously might 
be a little inconvenient, especially for one who is not familiar with the calculus of 
residues. The delicate point is that the right-hand side of ()A.2|) . when extended 
to a meromorphic function on the whole complex plane, has infinitely many 
poles. (In this context, see sections 3 and 4 in Penrose and Elvey, where a 
somewhat more general case is treated.) In this respect, application of the 
Wiener-Ikehara theorem is a good alternative. 

In any case, Gates and Westcott 's result has not received its due recognition. 

Appendix B: 

The Wiener— Ikehara theorem 

For the convenience of the reader, we quote Theorem 4.2 from Korevaar [4]. 

Theorem B.l (The Wiener-Ikehara theorem). Let S{t) vanish for t < 0, be 
nonnegative and nondecreasing for t > 0, and such that the Laplace-Stieltjes 
transform 

/•OO /-OO 

f{z) ^ £dS{z) = / e"^* dS{t) = z / S'(i)e-'=* di, z = x + iy, 
Jo- Jo 

exists for Re(z) = a; > 1. Suppose that for some constant A, the analytic 

function 

g{z) = f{z) -, a;> 1, 

z — 1 

has a boundary function 5(1 + vy) in the following sense. For x \, \, the function 

g-xiSy) ~ g{x + iy) converges to g{l + iy) uniformly or in L^ on every finite 

interval — A < y < A. Then, 

e~*S'(i) -> A asi— ^00. 
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Abstract 

> 

fT^ ' In this addendum, we further discuss numerical issues concerning the 

OO ■ computation of the probability denoted 0(s,i; A) in ,1,. In particular, and 

most importantly, we correct the naive claim made in the abstract of [T] 
that the explicit expression of 0(s, t; A) becomes intractable for very large 

^— «, . values of t/s; rather, we show that it may not be applicable for large 

f^ , values of At. 

We use the notation of pT. At first glance, the expression on the right of 



[H Eq. (1.3)] seems intractable for very large values of t/s. In some respects this 
impression is correct (comparing to the approximations given in [T]). However, 
in order to reveal a significant practical drawback of (|A.ip . we should rather 
turn our attention to large values of Xt. To exemplify this claim, we suppose — as 
in [I] — that the computation of (j){s, t; A) is implemented in Java (using double 
precision) . In this setting, the value of an expression is Infinity if it overflows 
the value of Double .MAX_VALUE, which is approximately 1.8 • 10^"^. Thus, the 
term a^ := X''[t — (fc — l)s]'^/fc! should be computed with some care, say as 
Ofe = 11^=1 -^[^ ^ (^ ~ l)'^]/*- Doing so, no overflow problem would arise as long 



as Xt (rather than t/s) is not too large. Indeed, since ip{s, t; A) is a probabihty, 
the expression muhiplying e^'*'* on the right of (lA.ll) is bounded from above 
by e"^*; so are the partial products of a^, since Hi^i -^[^ — {^ — ^)s]/i < {Xty /j\ 
(1 < J < k). Thus, the condition Xt < c* where c* = ln(Double.MAX_VALUE) w 
709.78 guarantees the applicability of (jA.l[) . (It should be noted here that 
Double. MIN_VALUE = 4.9 x lO"^^'^ < e"'^'^"'.) For larger values of At, the story 
is quite different. Suppose that Ai > c*. Then, in view of (|A.ip . an overflow 
problem wiU occur if (/)(s,i; A) > p* where p* = e^'^*Double.MAX_VALUE. (For 
example, if At = 716, then p* k, 2- 10^'^.) In order to mitigate this problem, we 
first rewrite (|A.1I) as follows: 

0(s, t- X) = c-^* + ^ n exp(-At/fc)^i^^-^^^^. (A.2) 

fc=i 1=1 

As one would expect, this representation allows for substantially larger values 
of At than previously. However, the improvement is not drastic, as we now 
exemplify. Let r = 4 x 10®, and set s = l/\/r, t = y^, and A = 1. Hence, 
At = 2000. The approximation 1, Eq. (2.16)] 

(/.^(s, t; A) := A(As)e-^[i--^(^'')l* (A.3) 

of 0(s,t;A) gives us 0(s,t;A) = 0.36815525... (in agreement with the estimate 
e""'^ + 1.5e~^/^/r = 0.36815535... indicated in [IJ Example 3.1]). Truncating the 
sum in (|A.2p at k = 1942 contributes less than 11% to the value of (f>{s,t;X); 
the 1943th term, on the other hand, contributes Infinity. (Hence (jA.2p is useless 
here.) To understand this phenomenon, consider the partial products of the fcth 
term: 

n(j;fc):=nexp(-AtA)^I^^i^^iM, i < , < fc. 

While, theoretically, \i(k\ k) is eventually small, it might reach Infinity on the 
way. In our example, the theoretical value of H(1943; 1943) is approximately 
0.00155 (found by (IXi]) . below) but H(699; 1943) = Infinity; this resuhs in 
H(1943; 1943) (and hence also 0(s, t; A)) being equal to Infinity. See also Figure 
[TJ It is thus natural to consider the following modification of (|A.2p : 

ft/si k 

0(.,t;A) =e-^*+ 5: nexp(-At/fc)^i^^-i^-iM. (A.4) 

k=l 1=1 ^ ' 

However, it turns out that computing 0(s,t;A) this way does not lead to an 
essential improvement, if any. 




800 1000 1200 1400 1600 1800 2000 



Figure 1: Plot of the function lnn(j;fc), k = 1942, j = 1, 



Here 



maxj lnn(j; k) w 709.56, just below the permitted maximum of c* sa 709.78. 



Having partially resolved the computational overflow problem of (jA.ip . we 
now turn to consider the computational time problem indicated in [Ij Example 
3.1]. For the purposes of discussion, we assume that 0(s, t; A) is computed using 
(jA.ip and, of course, that t/s is very large; in view of the preceding discussion, we 
assume that Xt is not too large. In particular, we assume that At is substantially 
smaller than t/s; equivalently, As is substantially smaller than 1. (In view of 
(jA.6p . this restriction is very mild.) In this setting, the aforementioned problem 
is resolved by the following observation. For any a > fixed and At sufficiently 
large (depending on a), 



<)0(s 


t;A)- 


e-^* 


>; 

\k-\t\<a 


A'=[f- 
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\k-Xt\>aV\i, 


fe>0 











(A.5) 



where $ denotes the distribution function of the N(0, 1) distribution. The ap- 
proximation on the right follows by a straightforward application of the central 
limit theorem. (The basic idea is that if Xi are i.i.d. Poisson(l) and At is an 
integer, then J2i=i-^i is Poisson(At).) It is very rough, as one can conclude 
by carrying out the proof, but still provides a satisfactory order of magnitude. 
The implication of (jA.sp is that (/)(s,t;A) can be approximated very well by 
summing a relatively small number of terms that dominate the calculation. As 
an example, let s = 1/400, t = 400, and A = 1. (Hence, 0(s,t;A) « 0.369.) 



Approximating (j){s, t; A) according to (|A.5I) with a = 5 (hence, 199 terms are 
taken) gives an error of approximately 2.24 x 10^^, in agreement with the hy- 
pothetical upper bound 2(1 — $(5)) ~ 5.73 x 10~^. Increasing a to 6 (hence, 
239 terms are taken) gives an error of approximately 1.07 x 10~^, in agreement 
with the hypothetical upper bound 2(1 — <1'(6)) ~ 1.97 x 10^^. As a further 
example, let s = 10"^, t ^ 10^, and A == 0.4. (Hence, 0(s,i;A) w 0.852.) Ap- 
proximating (f>{s, t] A) by summing from k = 190 to 650 in (|A.ip . gives an error 
of approximately 5.77 x lO"'^^ (confirmed, using WIMS "Function calculator", 
by comparison with the value of 0*(s,t;A)). We conclude that the apparent 
computational time problem indicated in jl] Example 3.1] is simply resolved by 
truncating the sum in (lA.ip according to (|A.5|) . 

We take this opportunity to make the following remark. 

Remark A.l. Extending Remark 1.1 of [I], we point out that depends on s, 
t, and A only through As and Xt = Xs{t/s). In particular, we have that 

(j){s, t; A) = <f){Xs, Xt; 1) = 0(1, t/s; As). (A.6) 

The new representation, 0(s, t; A) — 4>{Xs, Xt; 1), is instructive in view of (|A.3p . 
(|A.5|) . (|A.7|) . and T, Eqs. (2.4) and (2.5)]. Moreover, it shows that the com- 
putational problem indicated in the first paragraph cannot be simply resolved 
by some transformation of the parameters. On the other hand, the representa- 
tion 0(s, t; A) = 0(1, t/s; As) arises from the setting of Theorem 2.2 of [1] — the 
fundamental result of the paper. 

Suppose that (|A.ip (including its variants, (IA.2I) and (jA.4p ]) is not applicable 
due to a large value of Xt. If As • Xt is smaller than 1 or so (the smaller the 
better), then 0(s, t; A) can be approximated fairly accurately by the asymptotic 
result [t Eq. (2.1)] 

N 

(/)(s,i;A)-y (-l)"P„(At)^^ = 0(s^+i) assiO. (A.7) 

n=Q 

(The larger N is, the better the approximation.) However, such approximation 
is inferior to the outstanding one (|A.3p . This point is illustrated in Table [1] (cf. 
also [I] Table 3] ) . For convenience of implementation, it pays to rearrange the 
expression for P„(x) [1] Section 2] as follows: 



J=0 






(where S{i,j) denote the Stirling numbers of the second kind). Finally, it should 
be stressed that (JA.7P is not effective when As • At is substantially greater than 



Table 1: Tabulated rounded values of (j}*is,t;X) - J2n=Q (-!)"-?« (Ai)A"s"/n!, 
^ = 6, . . . , 10, for A = 1, i = 10*, and s = IQ-*, 10-^ and 10"^ 

~s N 

6 7 8 9 10 

10-* -1.8- 10-* +2.2- 10-5 -2.5-10-6 +2.6- 10"^ -2.3-10-*^ 

10-*^ -2.0-10-18 +2.5-10-21 -2.8-10-24 +2.8 - 10-^^ -2.5-10-^° 

10-*^ -2.0-10-32 +2.5-10-37 -2.8-10-42 +2.8 - lO'^^ -2.5-10-^2 



1, say greater than 3. In this case, and in general when Xt is large, a satisfactory 
approximation to (f>{s, t; A) may be obtained simply by the asymptotic estimate 
g^A^st (cf. 1, Eqs. (2.4) and (2.5)]). For example, if As = 10-^ and Xt = 4x 10^, 
then (j){s,t;X) - e-^'"* « 1.1 x 10-*; if As = 10-^ and At = 4 x 10^ then 
^(s,<;A)-e-^'^*« 1.1 X 10-*^. 
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